Finding the Homology of Submanifolds with High Confidence from Random Samples

نویسندگان

  • Partha Niyogi
  • Stephen Smale
  • Shmuel Weinberger
چکیده

Recently there has been a lot of interest in geometrically motivated approaches to data analysis in high dimensional spaces. We consider the case where data is drawn from sampling a probability distribution that has support on or near a submanifold of Euclidean space. We show how to “learn” the homology of the submanifold with high confidence. We discuss an algorithm to do this and provide learning-theoretic complexity bounds. Our bounds are obtained in terms of a condition number that limits the curvature and nearness to self-intersection of the submanifold. We are also able to treat the situation where the data is “noisy” and lies near rather than on the submanifold in question.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of Human Chromosome Segments that Have High Homology with Rat Genomic DNA

This study was conducted to determine the location of DNA segment with homology to the rat conserved genomic DNA in human chromosomes. The labeled rat genomic DNA was hybridized with normal human (male) metaphases. The study of 74 metaphases after fluorescence in situ hybridization showed 371 twin-spot signals on human chromosomes. Statistical analysis indicated that the specific accumulation o...

متن کامل

Reconstructing Functions from Random Samples

From a sufficiently large point sample lying on a compact Riemannian submanifold of Euclidean space, one can construct a simplicial complex which is homotopy-equivalent to that manifold with high confidence. We describe a corresponding result for a Lipschitz-continuous function between two such manifolds. That is, we outline the construction of a simplicial map which recovers the induced maps o...

متن کامل

Sampling from social networks’s graph based on topological properties and bee colony algorithm

In recent years, the sampling problem in massive graphs of social networks has attracted much attention for fast analyzing a small and good sample instead of a huge network. Many algorithms have been proposed for sampling of social network’ graph. The purpose of these algorithms is to create a sample that is approximately similar to the original network’s graph in terms of properties such as de...

متن کامل

Distribution Free Confidence Intervals for Quantiles Based on Extreme Order Statistics in a Multi-Sampling Plan

Extended Abstract. Let Xi1 ,..., Xini   ,i=1,2,3,....,k  be independent random samples from distribution $F^{alpha_i}$،  i=1,...,k, where F is an absolutely continuous distribution function and $alpha_i>0$ Also, suppose that these samples are independent. Let Mi,ni and  M'i,ni  respectively, denote the maximum and minimum of the ith sa...

متن کامل

Statistical Topology Using the Nonparametric Density Estimation and Bootstrap Algorithm

This paper presents approximate confidence intervals for each function of parameters in a Banach space based on a bootstrap algorithm. We apply kernel density approach to estimate the persistence landscape. In addition, we evaluate the quality distribution function estimator of random variables using integrated mean square error (IMSE). The results of simulation studies show a significant impro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Discrete & Computational Geometry

دوره 39  شماره 

صفحات  -

تاریخ انتشار 2008